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Abstract 

This paper proposes a spatio-temporal decomposition for the detection of moving targets in multiantenna SAR. As a high 
resolution radar imaging modality, SAR detects and localizes non-moving targets accurately, giving it an advantage over lower 
resolution GMTI radars. Moving target detection is more challenging due to target smearing and masking by clutter. Space-time 
adaptive processing (STAP) is often used to remove the stationary clutter and enhance the moving targets. In this work, it is shown 
that the performance of STAP can be improved by modeling the clutter covariance as a space vs. time Kronecker product with 
low rank factors. Based on this model, a low-rank Kronecker product covariance estimation algorithm is proposed, and a novel 
separable clutter cancelation filter based on the Kronecker covariance estimate is introduced. The proposed method provides orders 
of magnitude reduction in the required number of training samples, as well as improved robustness to corruption of the training data. 
Theoretical properties of the proposed estimation algorithm are established showing significant reductions in training complexity 
under the spherically invariant random vector model (SIRV). Finally, an extension of this approach incorporating multipass data 
(change detection) is presented. Simulation results and experiments using the Gotcha SAR GMTI challenge dataset are presented 
that confirm the advantages of our approach relative to existing techniques. 


I. Introduction 


T He detection (and tracking) of moving objects is an important task for scene understanding, as motion often indicates 
human related activity |29l . Radar sensors are uniquely suited for this task, as object motion can be discriminated via the 
Doppler effect. In this work, we propose a spatio-temporal decomposition method of detecting ground based moving objects 
in airborne Synthetic Aperture Radar (SAR) imagery, also known as SAR GMTI (SAR Ground Moving Target Indication). 

Radar moving target detection modalities include MTI radars EH, DU, which use a low carrier frequency and high pulse 
repetition frequency to directly detect Doppler shifts. This approach has significant disadvantages, however, including low 
spatial resolution, small imaging field of view, and the inability to detect stationary or slowly moving targets. The latter 
deficiency means that objects that move, stop, and then move are often lost by a tracker. 

SAR, on the other hand, typically has extremely high spatial resolution and can be used to image very large areas, e.g. 
multiple square miles in the Gotcha data collection lt34ll . As a result, stationary and slowly moving objects are easily detected 
and located DU, E2). Doppler, however, causes smearing and azimuth displacement of moving objects j25l . making them 
difficult to detect when surrounded by stationary clutter. Increasing the number of pulses (integration time) simply increases 
the amount of smearing instead of improving detectability li25l . Several methods have thus been developed for detecting and 
potentially refocusing moving targets in clutter. Our goal is to remove the disadvantages of MTI and SAR by combining their 
strengths (the ability to detect Doppler shifts and high spatial resolution) using space time adaptive processing (STAP) with a 
novel Kronecker product spatio-temporal covariance model, as explained below. 

SAR systems can either be single channel (standard single antenna system) or multichannel. Standard approaches for the single 
channel scenario include autofocusing DU and velocity filters. Autofocusing works only in low clutter, however, since it may 
focus the clutter instead of the moving target M, E2- Velocity filterbank approaches used in track-before-detect processing 
li25l involve searching over a large velocity/acceleration space, which often makes computational complexity excessively high. 
Attempts to reduce the computational complexity have been proposed, e.g. via compressive sensing based dictionary approaches 
ll26l and Bayesian inference ll29l . but remain computationally intensive. 

Multichannel SAR have the potential for greatly improved moving target detection performance DU, EH, DU- Standard 
multiple channel configurations include spatially separated arrays of antennas, flying multiple passes (change detection), using 
multiple polarizations, or combinations thereof |29l . Disadvantages to these approaches include the higher data rate created 
by collecting multiple channels and the fact that multiple passes involve long delays, registration issues, and having to fly the 
same orbit more than once (29). 


A. Previous Multichannel Approaches 

Several techniques exist for using multiple radar channels (antennas) to separate the moving targets from the stationary back¬ 
ground. SAR GMTI systems have an antenna configuration such that each antenna transmits and receives from approximately 
the same location but at slightly different times E3, DU, EH- Along track interferometry (ATI) and displaced phase center 
array (DPCA) are two classical approaches l29l for detecting moving targets in SAR GMTI data, both of which are applicable 
only to the two channel scenario. Both ATI and DPCA first form two SAR images, each image formed using the signal 
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from one of the antennas. To detect the moving targets, ATI thresholds the phase difference between the images and DPCA 
thresholds the magnitude of the difference. A Bayesian approach using a parametric cross channel covariance generalizing 
ATI/DPCA to p channels was developed in |29| . Space-time Adaptive Processing (STAP) learns a spatio-temporal covariance 
from clutter training data, and uses these correlations to filter out the stationary clutter while preserving the moving target 
returns ED, ED- 

A second configuration uses phase coherent processing of the signals output by an antenna array for which each antenna 
receives spatial reflections of the same transmission at the same time. This contrasts with the above configuration where each 
antenna receives signals from different transmissions at different times. In this second approach the array is designed such that 
returns from different angles create different phase differences across the antennas m. EE). ED, ED. E). In this case, the 
covariance-based STAP approach, described above, can be applied to cancel the clutter 1(311 . US), l(23l . 

In this paper, we focus on the first (SAR GMTI) configuration and propose a covariance-based STAP algorithm with a 
customized Rronecker product covariance structure. The SAR GMTI receiver consists of an array of p phase centers (antennas) 
processing q pulses in a coherent processing interval. Define the array X !r V £ O' x q such that x'-"' 1 is the radar return from 
the jth pulse of the zth channel in the mth range bin. Let x m = vecfX 1 ''"'- 1 ). The radar data x m is complex valued and is 
assumed to have zero mean. Define 

S = Cov[x] = E[xx H ], (1) 

The training samples, denoted as the set S, used to estimate the SAR covariance X are collected from n representative 
range bins. The standard sample covariance matrix (SCM) is given by 

S=-£ x m x^. (2) 

mG*S 

If n is small, S may be rank deficient or ill-conditioned l29l . iflSTl . j2ll . [22], and it can be shown that using the SCM directly 
for STAP requires a number n of training samples that is at least twice the dimension pq of S l33l . In this data rich case, STAP 
performs well ED, ED, ED. However, with p antennas and q time samples (pulses), the dimension pq of the covariance is 
often very large, making it difficult to obtain a sufficient number of target-free training samples. This so-called “small n large 
p” problem leads to severe instability and overfitting errors, compromising STAP tracking performance. 

By introducing structure and/or sparsity into the covariance matrix, the number of parameters and the number of samples 
required to estimate them can be reduced. It has been noted JS), 1T81 . ITTI that the spatiotemporal clutter covariance X is 
low rank in general, indicating that the clutter lives in a spatiotemporal subspace of dimension r. This reduces the number of 
parameters describing the covariance matrix from 0{p 2 q 2 ) to 0(rpq). Hence, a common approach to STAP clutter cancelation 
m. ed is to estimate a low rank clutter subspace from S and use it to estimate and remove the clutter component in the data 
E), m- We call these methods Low Rank STAP (LR-STAP). Efficient algorithms, including some involving subspace tracking, 
have been proposed 0. tm Other methods adding structural constraints such as persymmetry usi. a and robustification to 
outliers either via exploitation of the SIRV model ifTTl or adaptive weighting of the training data fl5l have been proposed. Fast 
approaches based on techniques such as Krylov subspace methods lfl9l . l24l . l30l . |35) and adaptive filtering fl2) . lfj~3l exist. 
All of these techniques remain sensitive to outlier or moving target corruption of the training data, and generally still require 
large training sample sizes ]29| . In addition, to the best of our knowledge, none of these techniques explicitly incorporate the 
known spatio-temporal structure of the data into the covariance estimator. The contribution of this paper is to apply covariance 
estimation techniques designed to exploit spatio-temporal structure in order to significantly reduce the number n of training 
samples required as well as to provide a degree of robustness to corrupted training data. 

We exploit the explicit space-time arrangement of the covariance by modeling the clutter covariance matrix X c as the 
Rronecker product of two smaller matrices 

X c = A<g>B, (3) 

where A £ 0' x? ' is rank 1 and B £ C qxq is low rank. In this setting, the B matrix is the “temporal (pulse) covariance” and 
A is the “spatial (antenna) covariance,” both determined up to a multiplicative constant. 

Rronecker product covariances arise in a variety of applications, including MIMO radar 1(401 . geostatistics ESI, recommen¬ 
dation systems ID, multi-task learning ID, and genomics |42) . A rich set of algorithms and associated performance guarantees 
exist for estimation of covariances in Rronecker product form, including iterative maximum likelihood li39l . (38), noniterative 
L2 based approaches (39), sparsity promoting methods (38), l43l . and robust ML SIRV based methods (22) . Many of these 
methods have been shown to achieve significant reductions in the number of training samples required for estimation, in line 
with the reduction in the number of parameters in the Rronecker covariance model (39) . (38) . |43) . 

In this paper, an iterative L2 based algorithm is proposed to directly estimate the low rank Rronecker factors from the 
observed sample covariance. Convergence and symmetric positive semidefiniteness of the estimator is established. Theoretical 
results indicate significantly fewer training samples are required, and it is shown that the proposed approach improves robustness 
to corrupted training data. Critically, robustness allows significant numbers of moving targets to remain in the training set. We 
then introduce the Rron STAP filter, which projects away both the spatial and temporal clutter subspaces. This projects away 
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a higher dimensional subspace than does LR-STAP, thereby achieving improved noise and clutter cancelation. We note that 
this algorithm differs significantly from the set of methods known as Rron PCA, which involves modeling the covariance as 
a sum of Kronecker products ED, CD, d- 

To summarize, the main contributions of this paper are: 1) the exploitation of the inherent Kronecker product spatio- 
temporal structure of the clutter covariance; 2) the introduction of the low rank Kronecker product based Kron STAP filter; 
3) an algorithm for estimating the spatial and temporal clutter subspaces that is highly robust to outliers due to the additional 
Kronecker product structure; and 4) theoretical results demonstrating improved signal-to-interference-plus-noise-ratio; and 5) 
an extension to multipass STAP . 

The remainder of the paper is organized as follows. Section [II] presents the multichannel SIRV radar model. Our low rank 
Kronecker product covariance estimation algorithm and our proposed STAP filter are presented in Section |Hl)vith an extension 
to the case of moving target detection with multiple passes . Section [IV] gives theoretical performance guarantees and Section 
[V] gives simulation results and applies our algorithms to the Gotcha dataset. 

In this work, we denote vectors as lower case bold letters, matrices as upper case bold letters, the complex conjugate as a*, 
the matrix Hermitian as A H , and the Hadamard (elementwise) product as A 0 B. 

II. SIRV Data Model 

Let X £ C px? be an array of radar returns from an observed range bin across p channels and q pulses. We model x = vec(X) 
as a spherically invariant random vector (SIRV) with the following decomposition ED, GD, ED, ED: 

^target T ^clutter T noise — ^target T tl, (4) 

where x no j se is Gaussian sensor noise with Cov[x no j se ] = a 2 1 £ C pqxpq and we define n = x c / utter + x no j se . The signal 
of interest x target is the sum of the spatio-temporal returns from all moving objects, modeled as non-random, in the range 
bin. The return from the stationary clutter is given by x c ; utter = re where r is a random positive scalar having arbitrary 
distribution, known as the texture, and c £ C pq is a multivariate complex Gaussian distributed random vector, known as the 
speckle. We define Cov[c] = S c and note that c is determined by the arrangement of stationary scatters on the ground. The 
means of these components of x are zero. The resulting clutter plus noise (x tar get = 0 ) covariance is given by 

E = £[nn ff ] = £[t 2 ]X c + ct 2 I. (5) 

The ideal (no calibration errors) random speckle c is of the form J29|, ED 

c = l p 0 c, (6) 

where c £ C q . The representation (|G]» follows because the antenna configuration in SAR GMTI is such that each antenna receives 
signals emitted at different times at the same points in space l29l . B34| . The representation ([6} gives a clutter covariance of 

S c = 11 t 0B, (7) 

where 

B = E[cc h ], (8) 

B depends linearly on the spatial covariance function C of the clutter reflectivity, which in turn depends on the spatial 
characteristics of the clutter in the region of interest ED . While in SAR GMTI B is not exactly low rank, it is approximately 
low rank in the sense that significant energy concentration in a few principal components is observed over small regions Q. 

Due to the long integration time and high cross range resolution associated with SAR, the returns from the general class of 
moving targets are more complicated. However, if we restrict to targets having constant doppler shift / (proportional to the 
target radial velocity) within a range bin, the return has the form 

x = ad = aa(/) 0 b(/), (9) 

where a is the target’s amplitude, a(/) = [ 1 e l 2ve i(f') ... e o s P (f) ] T 5 the 9i depend on doppler shift / and the platform 
speed and antenna separation |29) , and b £ C q depends on the target, /, and its cross range path. The unit norm vector 
d = a(/) 0 b(/) is known as the steering vector. For sufficiently large 0i(f), a(/) ii l will be small and the target will lie 
outside of the SAR clutter spatial subspace. Furthermore, as observed in ED. for long integration times the return of a moving 
target is significantly different from that of uniform stationary clutter, implying that moving targets generally lie outside the 
temporal clutter subspace ED as well. 

In practice, the signals from each antenna have gain and phase calibration errors that vary slowly across angle and range (29). 
It was shown in (29) that in SAR GMTI these calibration errors can be accurately modeled as constant over small regions. Let 
the calibration error on antenna i be hie^ i and h = [ hie^ 1 , ..., h p e^ p ], giving an observed return x' = (h0l)0x 

and a clutter covariance of 

S c = (hh ff ) 0 B = A 0 B 


implying that the A in (|3]» has rank one. 
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A. Space Time Adaptive Processing 

Let the vector d be a spatio-temporal “steering vector” m. that is, a matched filter for a specific target location/motion 
profile. For a measured array output vector x define the STAP filter output y = w T x, where w is a vector of spatio-temporal 
filter coefficients. By Q and ([9]) we have 

i/=w ff x = aw I d + w fi n. (11) 


The goal of STAP is to design the filter w such that the clutter is canceled (w ff n is small) and the target signal is preserved 
(w H d is large). For a given target with spatio-temporal steering vector d, we say that the filter w is an optimal clutter 
cancellation filter if it maximizes the SINR (signal to interference plus noise ratio), defined as the ratio of the power of the 
filtered signal aw ff d to the power of the filtered clutter and noise m 


SINR out 


H 2 |w g d| 2 

Ffw-^nn^w] 


|q| 2 |w g d| 2 

w fl Ew 


( 12 ) 


where E is the clutter plus noise covariance in <|5). 

It can be shown cd, m that, if the clutter covariance is known, under the SIRV model the optimal filter for targets at 
steering vector d is given by the filter 


w = F opt d, 

(13) 

where 


F op t = S 1 . 

(14) 

Since the true covariance is unknown, we consider filters of the form 


w = Fd, 

(15) 


and use the measurements to learn an estimate of the best F. 

Traditionally, the sample covariance 0 has been used to learn the clutter covariance ESI- For ri pq the inverse of 
the sample covariance matrix can be used to reliably estimate the optimal filter ( p~4] ). Generally, in STAP n < pq and a 
regularized inverse of the sample covariance is often used as an approximation to ( fl4| ). For this a dimensionality reduction 
method called clutter subspace processing can be used, giving an alternative filter F that approximates ( fl4| ) by projecting onto a 
low dimensional subspace. This approach is effective when the clutter subspace is of low rank r <C pq. Most STAP techniques 
were developed for classical GMTI radars, for which the covariance is low rank by Brennan’s rule (8). This approach is also 
valid for SAR GMTI since the clutter covariance is also low rank no, mi. 

In clutter subspace processing a clutter subspace { u, }[ =1 is estimated using the span of the top r principal components of 
the clutter sample covariance mi, mi- The corresponding clutter cancelation filter is given by the matrix F that projects onto 
the space orthogonal to the estimated clutter subspace: 

r 

F = I-^u,uf. (16) 

i=l 

Since the sample covariance requires a relatively large number of training samples, obtaining sufficient numbers of target 
free training samples is a practical problem lf29l . |fl8l . In addition, if low amplitude moving targets are accidentally included in 
training, the sample covariance will be corrupted and partially cancel moving targets as well, which is especially problematic 
in online STAP implementations HD, 0. The STAP approach discussed below mitigates these problems as it directly takes 
advantage of the inherent space vs. time Kronecker structure of the clutter covariance E c . 


III. Kronecker STAP 

A. Kronecker Subspace Estimation 

In this section we develop a subspace estimation algorithm that accounts for spatio-temporal covariance structure and has 
low computational complexity. 

Following the approach of ED, ED, EH, E 2 , we fit the low rank Kronecker product model ( fTO} to the sample covariance 
matrix S subject to rank(A) < r a ,rank(B) < rb, where the goal is to estimate E[t 2 ]Y, c . The estimation of the parameters 
A and B in |T 0 ) is performed by minimizing the following objective function 

A, B = arg min ||S — A <g> B\\ 2 F . (17) 

rank(A)<r a ,rank(i3)<r5 

For a pq x pq matrix M define {M(*, j)}*L =1 to be its q x q block submatrices, i.e. M (i,j) = 

Also, let M = q MK Pi9 where K p g is the pq x pq permutation operator such that K Pig vec(N) = vec(N T ) for any p x q 
matrix N. 
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The invertible Pitsianis-VanLoan rearrangement operator H(-) maps ptp s x ptp s matrices to p 2 x p 2 s matrices and, as defined 
in 63, El sets the (i — 1 )p t + jth row of 7£(M) equal to vec(M(z, j)) T , i.e. 

'R.{M) = [m 1 ... m p 2 ] T , (18) 

m (i-l)pt+j = Vec(M(*,j)), M = ■ ,Pt- 

The unconstrained (i.e. r a = p. rt, = q) objective in © is shown in 63, 63, ED to be equivalent to a rearranged 
rank-one approximation problem, with a global minimizer given by 

A ® B = 7?.” 1 (ctiUiv^), (19) 


where <j\ ii| vf^ is the first singular component of 7£(S). 

When the low rank constraints are introduced, a closed-form solution of © is no longer available. An alternating 
minimization algorithm is derived in Appendix [A] and is summarized by Algorithm [I] In Algorithm [l] EIG r (M) denotes 
the matrix obtained by truncating the Hermitian matrix M to its first r principal components, i.e. 

r 

EIG r (M) := y^q-jUjuf, (20) 

2=1 


where OjUjuf’ is the eigendecomposition of M, and the (real and positive) eigenvalues er; are indexed in order of 
decreasing magnitude. The objective ( fT7) > is not convex, but since it is an alternating minimization algorithm. Algorithm 
[0 gives monotonic convergence of the objective © to a local minimum 0. In practice, we typically initialize LR-Kron with 
either EIG r<l (A), EIG rt (B) where A,B are from the unconstrained estimate ©. Monotonic convergence then guarantees 
that LR-Rron improves on this simple closed form estimator. 

We call Algorithm [T] low rank Kronecker product covariance estimation, or LR-Kron. In Appendix [A] it is shown that when 
the initialization is positive semidefinite Hermitian the LR-Rron estimator A 0 B is positive semidefinite Hermitian and is 
thus a valid covariance matrix of rank r a ri,. 


Algorithm 1 LR-Kron Covariance Estimation 


S = Sscm, form S (i,j), S 

Initialize A s.t. ||Aj|^ = 1 (or correspondingly B). 


while Objective 
Its — 


IL S — A < 

s ( i , j ) 


) B ||p not converged do 


B = EIG r 
n A = ^ 


A||| 

(Rs) 


ml 

A = EIGr a (Ra) 

end while 

return A = A, B = B. 


B. Robustness Benefits 

Besides reducing the number of parameters, Kronecker STAP enjoys several other benefits arising from associated properties 
of the estimation objective ©■ 

The clutter covariance model ( |T()| is low rank, motivating the PCA singular value thresholding approach of classical STAP. 
This approach, however, is problematic in the Kronecker case because of the way low rank Kronecker factors combine. 
Specifically, the Kronecker product A 0 B has the SVD l28l 

A0B = (U B ®U B )(S A ®S B )(Uf ®Ug) (21) 

where A = U,iS/iU , ( and B = U /> S n Uare the SVDs of A and B respectively. The singular values are Vi,j. 

As a result, a simple thresholding of singular values is not equivalent to separate thresholding of the singular values of A and 
B and hence won’t necessarily adhere to the space vs. time structure. 

For example, suppose that the set of training data is corrupted by inclusion of a sparse set of w moving targets. By the 
model <|9), the ith moving target gives a return (in the appropriate range bin) of the form 

Z i = a,;a,; 0 hi, (22) 


where a,, b, are unit norm vectors. 
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This results in a sample data covariance created from a set of observations n m with Cov[n m ] = £, corrupted by the addition 
of a set of w rank one terms 



Let s = L Em=i n m n m and T = L z,zf. Let A s ,fc be the eigenvalues of S c , A s ,min = min fc A s ,k, and let A T ,max 

be the maximum eigenvalue of T. Assume that moving targets are indeed in a subspace orthogonal to the clutter subspace. 
If T,max > O ( Xs.rmn) • performing rank r PCA on S will result in principal components of the moving target term being 
included in the “clutter” covariance estimate. 

If the targets are approximately orthogonal to each other (i.e. not coordinated), then A T,max = «, | 2 ). Since the smallest 

eigenvalue of E c is often small, this is the primary reason that classical LR-STAP is susceptible to moving targets in the training 
data El, JH. 

On the other hand, Kron-STAP is significantly more robust to such corruption. Specifically, consider the rearranged corrupted 
sample covariance: 

1 w 

1Z(S) = - 'S'' vec(ajaf / )vec(bjbf / ) ff + 1Z(S). (24) 

n 

m= 1 

This also takes the form of a desired sample covariance plus a set of rank one terms. For simplicity, we ignore the rank 
constraints in the LR-Kron estimator, in which case we have © 

A ® B = 7?.~ 1 ((TiUivf f ), (25) 

where criuiv^ is the first singular component of 1Z(S). Let <j\ be the largest singular value of 7Z(S). The largest singular value 
<ji will correspond to the moving target term only if the largest singular value of L v ec(aiaf r )vec(b,bf f ) ff is greater 

than O(cti). If the moving targets are uncoordinated, this holds if for some i, L|a ,| 2 > O(eri). Since g\ models the entire 
clutter covariance, it is on the order of the total clutter energy, i.e. er 2 = 0(Y ^ = i A 2 k ) A| min . In this sense Kron-STAP 
is much more robust to moving targets in training than is LR-STAP. 


C. Kronecker STAP Filters 

Once the low rank Kronecker clutter covariance has been estimated using Algorithm [T] it remains to identify a filter F, 
analogous to that uses the estimated Kronecker covariance model. If we restrict ourselves to subspace projection filters 
and make the common assumption that the target component in ([4]) is orthogonal to the true clutter subspace, then the optimal 
approach in terms of SINR is to project away the clutter subspace, along with any other subspaces in which targets are not 
present. If only target orthogonality to the joint spatio-temporal clutter subspace is assumed, then the optimal STAP filter is 
the projection matrix: 


^classical = I — UaU^ g> UfiUf, (26) 

where U^,Ub are orthogonal bases for the rank r a and r), subspaces of the low rank estimates of A and B, respectively, 
obtained by applying Algorithm [I] This is the Kronecker product equivalent of the standard STAP projector ( | 1 6[ i. 

Additional information is available, however. Specifically, by & no moving target should lie in the same spatial subspace 
as the clutter. We thus propose the spatial-only filter (which we call spatial-only Kron STAP) 

F spatial = (I - UaU^) <g> I. (27) 

to cancel as much of the clutter “subspace leakage” as possible while minimizing target cancelation. This leakage is due to 
noise and covariance estimation errors. Furthermore, as noted in Section [II] if the dimension of the clutter temporal subspace 
is sufficiently small relative to the dimension q of the entire temporal space, moving targets will have temporal factors (b) 
whose projection onto the clutter temporal subspace are small. Under these assumptions, it is thus near-optimal to project away 
both the temporal and spatial clutter subspaces. We thus propose a Kronecker STAP filter F kstap of the following form: 

F kstap = (I - Ua^a ) ® (I - U B Uf) = F a <g> F B . (28) 

We denote by Kron-STAP the method using LR-Kron to estimate the covariance and ( [28] ) to filter the data. Our clutter model 
has spatial factor rank r a = 1 implying that the F kstap defined in ( [28) projects the array signal x onto a (p — l)(q — rb) 
dimensional subspace. This is significantly smaller than the pq — r & dimensional subspace onto which ( |26) and unstructured 
STAP project the data. As a result, much more of the clutter that “leaks” outside the primary subspace can be canceled, thus 
allowing lower amplitude moving targets to be detected. 
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D. Multipass STAP 

In surveillance applications, it is often of interest to determine what, if anything, has changed in a scene between a reference 
time to and a later time t\, e.g. disappearance/appearance of parked vehicles, or the appearance of vehicle footprints lf29l . 
0, 0, EH. When SAR is used for such change detection applications, the radar platform will generally fly past the scene 
and form a “reference” image at time t 0 , and then at time t\ > f (l fly a path as close as possible to the original and form a 
new “mission” image. These images are then compared and changes detected. However, moving targets will almost always 
be detected as changes, along with the changes in the stationary scene background fl29l . When changes of background are of 
primary interest, moving targets may in fact mask changes in the stationary scene due to displacement and smearing. Hence, it 
is advantageous to identify moving targets in both scenes prior to or parallel to background change detection. In addition, it may 
be of interest to detect moving targets in the imagery for their own sake |f29| . We thus exploit the additional scene information 
arising from having two images to better estimate the clutter subspace, and follow STAP with subsequent noncoherent change 
detection. 

Our Kronecker STAP based change detection approach concatenates the spatial channels of both registered phase histories 
(Xfc), forming a “2 p channel phase history” 

" X, 

X 2 


X = 


G C 2pxq . 


(29) 


Since two images are involved with potentially different calibration errors, the clutter subspace is of rank 2. Thus, a rank 2 
spatial clutter subspace and a low rank temporal subspace are estimated using LRKron and projected away via the KronSTAP 
filter. This two pass procedure is easily extended to handle multiple (> 2) passes of the radar sensor. 

IV. SINR Analysis 

For a STAP filter matrix F and steering vector d, the data filter vector is © w = Fd m. With a target return of the 
form x target = ad, the filter output is given by (jTTJ, and the SINR by ( | 1 2\ . 

Define SINR max to be the optimal SINR, achieved at w opt = F opt d ( fl4| >. 

Suppose that the clutter has covariance of the form ( flQ| ). Assume that the target steering vector d lies outside both the 
temporal and spatial clutter subspaces as per above and lfl8ll . Suppose that LR-STAP is set to use r principal components. 
Suppose further that Kron STAP uses 1 spatial principal component and r temporal components, so that the total number of 
principal components of LR-STAP and Kron STAP are equivalent. 

Under these assumptions, if a approaches zero the SINR achieved using LR-STAP, Kron STAP or spatial Kron STAP with 
infinite training samples achieves iTTSIl S IN H , 

We analyze the asymptotic convergence rates under the finite sample regime. Define the SINR Loss p as the loss of 
performance when using the estimate w = Fd (corresponding to SINR,,,,*) as the filter instead of w opt : 

SINR out 
P = 


SINR mox 

Let Aj, i = 1,... ,pq be the eigenvalues of £ c . Under the Kronecker model, we have 


A,; = 




0 


- 


i = l,--.,r b 
i > r b 


(30) 


(31) 


since A only has one nonzero singular value. 

Theorem IV.l (LR-STAP SINR lll8h . For large n, the expected SINR Loss of LR-STAP is 


E[p] = l--Y. 


lv-/ E[T 2 ]Xi + (T 5 


i=l 


which in the small a 2 regime (typical in SAR M81 ) becomes 


Under the Kronecker model we have 


E[t 2 } A* 




E\p\* 1-- 


1 _ / E[t 2 ]s% + 5 )' 


i=l 


E[ 7 


NO 


(32) 


(33) 


(34) 


We now turn to Kron STAP. Note that the Kron STAP filter can be decomposed into a spatial stage (filtering by F Hpaflo i) 
and a temporal stage (filtering by F ternp ): 


F KSTAP = F^4 (g) Fs = F spatialFtemp 


( 35 ) 







where F spa tiai = F 4 0 1 and F temp - 1 0 F/> ( |28[ ). Under the idealized model in this section, either the spatial or the temporal 
stage is sufficient to project away the clutter subspace. We assume the naive estimator 


A = EIGi 



= fthh H 


(36) 


for the spatial subspace h (||h ||2 = 1)- This is equivalent to approximating the sample spatial covariance as rank 1. The analysis 
of lfl 8 l thus applies with r = 1 and n' = nq, except some of the samples are correlated. Using the Rronecker structure of the 
covariance it is trivial to show (for the SIRV distribution) that the worst case occurs when all the clutter temporal correlations 
are all ±1, in which case 3 JA S(i, i) reduces to an n iid sample SCM with Gaussian noise variance a 2 /q and we can directly 


obtain the following via Theorem IV. 1 


Theorem IV.2 (Kron STAP SINR). For large n and using the estimator ( |36 [ >, the expected SINR Loss of Kron STAP using 
the estimator ([36} for the spatial subspace satisfies 


E\p\ > 1 


1 ( E [ t 2 W +<^ \ 2 
n y E[T 2 ]tp J 


(37) 


/ / (1) trace(B) 

where = s A —^ 

In the small a 2 regime this becomes 

E[p] > 1 - 1 . (38) 

n 


Since by 0 r < q, the gains of using Kron STAP can be quite significant. 

Finally we consider the case where errors occurred in estimating the spatial covariance, either due to subspace estimation 
error or to A having a rank greater than one, e.g., due to small calibration errors. Specifically, suppose the estimated (rank 
one) spatial subspace is h, giving a Kron STAP spatial filter F spa tiai = (I — lih^) 0 1. Suppose further that spatial filtering of 
the data is followed by the temporal filter F temp based on the temporal subspace Us estimated from the training data. Define 
the SINR loss pt |h from using an estimate of U/> as 


SINR OMt 

SINR ma ,(h) 


(39) 


where SINR mox (h) is the maximum achievable SINR given that the spatial filter is fixed at F spatial = (I — hh // ) 0 I. Then 
it is shown in Appendix [B] that the expected SINR Loss of the temporal Kron STAP stage is given by the following theorem. 

Theorem IV.3 (Kron STAP (temporal stage) SINR). Suppose that a value for the spatial subspace estimate h (with || h|| 2 = 1) 
and hence F spatial is fixed. Let the steering vector for a constant Doppler target be d = ®dg per |9}, and suppose that 
d^ is fixed and d b is arbitrary. Then for large n 


E[p t \h] = 

1 - - V 

n < ^ 


a '(E[r02> + I n^)(E0] S g> + -^)' 


i— 1 


(E[t 2 ]s%) 


K = 


d|Ac Ia 

h H Ah 


In the small a 2 regime this becomes 


E[p t |h]«l-^. 


(40) 


(41) 


Note that in the n p regime relevant when 5 > p, h « h, where h is the first singular vector of A. This gives 
h u Ah « anc j K ^ () jf A is indeed rank one. Hence, k can be interpreted as quantifying the adverse effect of mismatch 
between A and its estimate. To avoid cancelation of the moving targets, it is necessary that rt, <C q, and since in the ideal 
large sample regime all the clutter is removed by the temporal stage, can be smaller than rank(B). Hence this slower SINR 
convergence rate in n on a smaller amount of cancelation than the spatial stage (since n should be small) is still faster than 
that of LR-STAP in general. 
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V. Numerical Results 


A. Dataset 

For evaluation of the proposed Rron STAP methods, we use measured data from the 2006 Gotcha SAR GMTI sensor 
collection ll34l . This dataset consists of SAR passes through a circular path around a small scene containing various moving 
and stationary civilian vehicles. The example images shown in the figures are formed using the backprojection algorithm with 
Blackman-Harris windowing as in (29). For our experiments, we use 31 seconds of data, divided into 1 second (2171 pulse) 
coherent integration intervals. 

As there is no ground truth for all targets in the Gotcha imagery, target detection performance cannot be objectively quantified 
by ROC curves. We rely on non ROC measures of performance for the measured data, and use synthetically generated data 
to show ROC performance gains. In several experiments we do make reference to several higher amplitude example targets in 
the Gotcha dataset. These were selected by comparing and analyzing the results of the best detection methods available. 


B. Simulations 


We generated synthetic clutter plus additive noise samples having a low rank Kronecker product covariance. The covariance 
we use to generate the synthetic clutter via the SIRV model was learned from a set of example range bins extracted from the 
Gotcha dataset, letting the SIRV scale parameter r 2 in Q follow a chi-square distribution. We use p = 3, q = 150, rj, = 20, 
and r a = 1, and generate both n training samples and a set of testing samples. The rank of the left Kronecker factor A, r a , 
is 1 as dictated by the spatially invariant antenna calibration assumption and we chose r f, = 20 based on a scree plot, i.e., 
20 was the location of the knee of the spectrum of B. Spatio-temporal Kron-STAP, Spatial-only Kron-STAP, and LR-STAP 
were then used to learn clutter cancelation filters from the training clutter data. The learned filters were then applied to testing 
clutter data, the mean squared value (MS Residual) of the resulting residual (i.e. (1/M) l ||Fx m || 2 ) was computed, and 
the result is shown in Figure [T| as a function of n. The results illustrate the much slower convergence rate of unstructured 
LR-STAP. as compared to the proposed Kron STAP, which converges after n = 1 sample. The mean squared residual does not 
go to zero with increasing training sample size because of the additive noise floor. 

To explore the effect of model mismatch due to spatially variant antenna calibration errors ( r a > 1), we simulated data with 
a clutter spatial covariance A having rank 2 with non-zero eigenvalues equal to 1 and 1/30 2 . The STAP algorithms remain the 
same with r a = 1, and synthetic range bins containing both clutter and a moving target are used in testing the effect of this 
model mismatch on the STAP algorithms. The STAP filter response, maximized over all possible steering vectors, is used as 
the detection statistic. The AUC of the associated ROC curves is plotted on the left in Figure [2] as a function of the number of 
training samples. Note again the poor performance and slow convergence of LR-STAP, and that spatio-temporal Kron-STAP 
converges very quickly to the optimal spatial Kron-STAP performance, and more slowly converges to a superior performance 
as the temporal filter estimate converges. 

Finally, we repeat the AUC vs. sample complexity experiment of the previous paragraph with 5% of the training data 
having synthetic moving targets with random Doppler shifts. The results are shown in Figure [3] As predicted by the theory 
in Subsection III-B| the Kronecker methods remain largely unaffected by the presence of cormpting targets in the training 
data, whereas significant losses are sustained by LR-STAP. This confirms the superior robustness of the proposed Kronecker 
structured covariance in our Kron STAP method. 




Fig. 1. Average mean squared residual (MSR), as a function of the number of training samples, of noisy synthetic clutter filtered by spatio-temporal Kron 
STAP, spatial only Kron STAP, and unstructured LR-STAP (SCM STAP) filters. On the right a zoomed in view of a Kron STAP curve is shown. Note the 
rapid convergence and low MSE of the Kronecker methods. 
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Fig. 2. Area-under-the-curve (AUC) for the ROC associated with detecting a synthetic target using the steering vector with the largest return, when slight 
spatial nonidealities exist in the true clutter covariance. Note the rapid convergence of the Kronecker methods as a function of the number of training samples, 
and the superior performance of spatio-temporal Kron STAP to spatial-only Kron STAP when the target’s steering vector d is unknown. 



Fig. 3. Robustness to corrupted training data: AUCs for detecting a synthetic target using the maximum steering vector when (in addition to the spatial 
nonidealities) 5% of the training range bins contain targets with random location and velocity in addition to clutter. Note that relative to Figure [2] LR-STAP 
has degraded significantly, whereas the Kronecker methods have not. 


C. Gotcha Experimental Data 

In this subsection, STAP is applied to the Gotcha dataset. For each range bin we construct steering vectors dj corresponding 
to 150 cross range pixels. In single antenna SAR imagery, each cross range pixel is a Doppler frequency bin that corresponds 
to the cross range location for a stationary target visible at that SAR Doppler frequency, possibly complemented by a moving 
target that appears in the same bin. Let D be the matrix of steering vectors for all 150 Doppler (cross range) bins in each 
range bin. Then the SAR images at each antenna are given by x = 1(g) D^x and the STAP output for a spatial steering vector 
h and temporal steering (separable as noted in ([9]>) is the scalar 

2 /i(h) = (h ( 8 ) di ) H Fx (42) 

Due to their high dimensionality, plots for all values of h and i cannot be shown. Hence, for interpretability we produce images 
where for each range bin the /th pixel is set as maxh y,.(h) |. More sophisticated detection techniques could invoke priors on 
h, but we leave this for future work. 

Shown in Figure[7]are results for several examplar SAR frames, showing for each example the original SAR (single antenna) 
image, the results of spatio-temporal Kronecker STAP, the results of Kronecker STAP with spatial filter only, the amount of 
enhancement (smoothed dB difference between STAP image and original) at each pixel of the spatial only Kronecker STAP, 
standard unstructured STAP with r = 25 (similar rank to Kronecker covariance estimate), and standard unstructured STAP 
with r = 40. Note the significantly improved contrast of Kronecker STAP relative to the unstructured methods between moving 
targets (high amplitude moving targets marked in red in the figure) and the background. Additionally, note that both spatial 
and temporal filtering achieve significant gains. Due to the lower dimensionality, LR-STAP achieves its best performance for 
the image with fewer pulses, but still remains inferior to the Kronecker methods. 

To analyze convergence behavior, a Monte Carlo simulation was conducted where random subsets of the (bright object 
free) available training set were used to learn the covariance and the corresponding STAP filters. The filters were then used 
on each of the 31 1-second SAR imaging intervals and the MSE between the results and the STAP results learned using the 
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entire training set were computed (Figure [4}. Note the rapid convergence of the Kronecker methods relative to the SCM based 
method, as expected. 




Fig. 4. Gotcha dataset. Left: Average RMSE of the output of the Kronecker, spatial only Kronecker, and unstructured STAP filters relative to each method’s 
maximum training sample output. Note the rapid convergence and low RMSE of the Kronecker methods. Right: Normalized ratio of the RMS magnitude 
of the brightest pixels in each target relative to the RMS value of the background, for the output of each of Kronecker STAP, spatial Kronecker STAP, and 
unstructured STAP. 


Figure [4] (right) shows the normalized ratio of the RMS magnitude of the 10 brightest filter outputs y,(h) for each ground 
truthed target to the RMS value of the background, computed for each of the STAP methods as a function of the number of 
training samples. This measure is large when the contrast of the target to the background is high. The Kronecker methods 
clearly outperform LR-STAP. 


D. Multipass Kron STAP 

Representative two pass Kronecker STAP results are shown in Figure [5] comparing to two pass LR-STAP and to standard 
(gain calibrated) incoherent change detection. For the STAP methods, noncoherent change detection is performed following 
filtering by reforming each image (via maximum steering vectors as in the previous section) and subtracting the resulting pixel 
magnitudes. It can be seen that additional clutter cancelation capabilities can be gained by using Kronecker STAP on multiple 
passes. 

As in the single pass case. Figure [6] shows relative RMSE convergence results and the normalized RMS ratio between targets 
and background. Again, Kron STAP outperforms the other methods, and both STAP methods outperform standard incoherent 
change detection. 


VI. Conclusion 

In this paper, we proposed a new method for clutter rejection in high resolution multiple antenna synthetic aperture radar 
systems with the objective of detecting moving targets. Stationary clutter signals in multichannel single-pass radar were shown 
to have Kronecker product structure where the spatial factor is rank one and the temporal factor is low rank. Exploitation 
of this structure was achieved using the Low Rank KronPCA covariance estimation algorithm, and a new clutter cancelation 
filter exploiting the space-time separability of the covariance was proposed. The resulting clutter covariance estimates were 
applied to STAP clutter cancelation, exhibiting significant detection performance gains relative to existing low rank covariance 
estimation techniques. As compared to standard unstructured low rank STAP methods, the proposed Kronecker STAP method 
reduces the number of required training samples and enhances the robustness to corrupted training data. These performance 
gains were analytically characterized using a SIRV based analysis and experimentally confirmed using simulations and the 
Gotcha SAR GMTI dataset. 


Appendix A 

Derivation of Algorithm Q] 


We have the following objective function: 


min 11S — A (g) B11 p 

rank( A) =r a ,rank(B) =ri 


( 43 ) 
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R ef erenc e Imag e Missi on Imag e 



Fig. 5. Multipass STAR An example reference and mission image pair are shown, both of which include moving targets. Shown are the results of incoherent 
change detection, multipass spatio-temporal Kron STAP, multipass LR-STAP, and the multipass spatial Kron STAP enhancement. Note the superior moving 
target enhancement of the Kronecker methods. 




Fig. 6. Left: Average RMSE of the output of the Kronecker and unstructured STAP filters and incoherent change detection relative to each method’s maximum 
training sample output. Note the rapid convergence and low RMSE of the Kronecker methods. Right: Normalized ratio of the RMS magnitude of the brightest 
pixels in each target relative to the RMS value of the background, for the output of each of Kronecker STAP, incoherent change detection, and unstructured 
STAP. 


To derive the alternating minimization algorithm, fix B (symmetric) and minimize ( [43] ) over low rank A: 


arg mm 

rank(A)=r a 


|S- A® B|||. 


= arg min V ||S(i, j) - b i;j A| 

rank(A )=r a — 


= arg min ^ IM 2 ||A||| - 2Re[6„- (A, S *(i,j))] 

rank(A)=r 0 — 

i,3 


= arg min ||A|||i — 2Re 

rank(A)=r a 


E ljhjS*(i,j)' 




= arg mm 

rank(A)=r a 


A — 


E lj b ij s (hj) 


mi 


(44) 
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where bij is the i,j th element of B and b* denotes the complex conjugate of b. This last minimization problem ( |44| ) can be 
solved by the SVD via the Eckart-Young theorem Do). First define 


R/i — 




l 3 


S (i,j) 


B 


(45) 


and let uf,af be the eigendecomposition of R 4 . The eigenvalues are real and positive because R 4 is positive semidefinite 
(psd) Hermitian if B is psd Hermitian [39]. Hence by Eckardt-Young the minimizer of the objective (|44|) is 


A(B) = EIG r . a (R y i) = 

1=1 

Similarly, minimizing ( [43] ) over B with fixed positive semidefinite Hermitian A gives 

B(A) =EIG rb (R s ) = X>fuf(uf) ff , 

i=l 


where now uf , af 1 describes the eigendecomposition of 


Rb = 




II 


(46) 


(47) 


(48) 


Iterating between computing A(B) and B(A) completes the alternating minimization algorithm. 

By induction, initializing with either a psd Hermitian A or B and iterating until convergence will result in an estimate 
A ® B of the covariance that is psd Hermitian since the set of positive semidefinite Hermitian matrices is closed. 


Appendix B 

Proof of Theorem IIV.3I 

After the spatial stage of Kron STAP projects away ( |27] > the estimated spatial subspace h (where |jh || 2 = 1) the remaining 
clutter has a covariance given by 

((I — hh H )A(I — hh ff )) ® B. (49) 

By ((9], the steering vector for a (constant Doppler) moving target is of the form d = d^ 0 d B . Hence, the filtered output is 

y = w H x = d H Fx (50) 

= (df 0 df )(Fa ® F s )x 
= ((dfF A )®(dfF B ))x 
= dfF s ((d^ (l-hh ff ))®l)x 

Let d 4 = (I — hh^jd^ and define c = ^d^ ® ij c. Then 

2 / = dfF B (Yc + n), (51) 

where n = (d .4 0 I)n and 

Cov[c] =(d^Ad, 4 )B (52) 

Cov[n] =cr 2 I, 

which are proportional to B and I respectively. The scalar (d^Adyi) is small if A is accurately estimated, hence improving 
the SINR but not affecting the SINR loss. Thus, the temporal stage of Kron STAP is equivalent to single channel LR-STAP 
with clutter covariance (d^ Ad^)B and noise variance a 2 . 

Given a fixed A = hh H , Algorithm [l] dictates ( |48j ), ( |47j ) that 

p 

Rb = ^ h*h*S(i,j) (53) 

ij 

B = EIG rfc (R B ), 

which is thus the low rank approximation of the sample covariance of 

x/j = x c>h + n h = (h <g> I) H (x c + n). 


(54) 
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Since x c = rc, x c ^ = r(h (8) I) - " c is an SIRV (Gaussian random vector (h (8) T) H c scaled by r) with 


Cov[x C: /j] = r 2 (h"Ah)B 


( 55 ) 


Furthermore, = (h® I) ff n which is Gaussian with covariance cr 2 I. Thus, in both training and filtering the temporal stage 
of Kron STAP is exactly equivale nt to single channel LR STAR Thus to prove Theorem |I V.3 1 it is straightforward to apply the 
analysis in the proof of Theorem IV. 1 mt with the noise variance in training effectively being k = times the noise 

variance in testing. 
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Fig. 7. Four example radar images from the Gotcha dataset along with associated STAP results. The lower right example uses 526 pulses, the remaining three 
use 2171 pulses. Several moving targets are highlighted in red in the spatial Kronecker enhancement plots. Note the superiority of the Kronecker methods. 
Used Gotcha dataset “mission” pass, starting times: upper left, 53 sec.; upper right, 69 sec.; lower left, 72 sec.; lower right 57.25 sec. 
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